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Abstract 

Background: The SPARC is a crucial matricellular protein and may influence the course of various diseases like tumor 
metastasis and fibrosis. In the present study, we investigated the association between the potential functional 
polymorphisms in SPARC and coal workers' pneumoconiosis (CWP) risk in a Chinese population. 

Methods: Five potentially functional polymorphisms (rs1059279, rs1059829, rs1053411, rs2304052 and rs4958281) in SPARC 
were genotyped and analyzed in a case-control study including 697 CWP cases and 694 controls. The genotyping was used 
by the TaqMan method with the ABI 7900HT Real Time PCR system. 

Results:Our results revealed that three SNPs (rs1059279, rs1059829, rsl 05341 1) were significantly associated with increased 
risk of CWP under an additive model (OR = 1.35, 95%CI = 1.06-1.71, P = 0.015 for rs1059279; OR=1.20, 95%CI = 1.03-1.39, 
P = 0.021 for rs1059829; OR = 1.31, 95%CI = 1 .03-1 .65, P = 0.025 for rs1053411). In the stratification analysis, significant 
associations were observed between each of these three SNPs and patients with 0-20 pack-years of smoking (OR= 1.73, 
95%CI = 1.21-2.45 for rsl 059279; OR =1.48, 95%CI = 1 .07-2.05 for rsl 05982; OR =1.58, 95%CI = 1 .1 3-2.22 for rsl 053411). 
Furthermore, the association between rsl 059279 and CWP risk remained significant among subjects with over 27 years of 
exposure (OR =1.27, 95%CI = 1.03-1.56, P = 0.023). In the combined analysis of these five polymorphisms, individuals with 
multiple risk alleles had a higher risk of CWP (Ptrend = 0.01 5). 

Conclusion: Our results indicate that three functional SPARC SNPs are associated with an increased risk of CWP in a Chinese 
population. Further functional research and validation studies with diverse populations are warranted to confirm our 
findings. 
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Introduction 

Coal Workers'Pneumoconiosis (CWP) is a serious occupational 
disease derived from inhalation and deposition of occupational 
coal mine dust and/or silica particulates in the lungs. The 
exposure may lead to the aberrant proliferation of activated 
fibroblasts, pathological remodeling, and excessive deposition of 
extracellular matrix (ECM) in lung tissues [1,2]. Currendy, there is 
no effective treatment of CWP. In China, 87.72% of the reported 
occupational cases were attributed to pneumoconiosis in 2013, of 
which CWP (60.28%) and silicosis (34.96%) accounted for the 
majority [3]. The incidence and progression of CWP are related to 
both dust exposure levels and silica content in the dust [4]. 
However, only a portion of individuals exposed to coal dust or 
silica develop CWP in their lifetime, suggesting that genetic 



susceptibility factors also play a role in the development of CWP 
[5]. 

SPARC (secreted protein, acidic and rich in cysteine), also 
known as osteonectin or BM-40, is a 43 kilo-Dalton matricellular 
glycoprotein which is secreted into the extracellular space along 
with other extracellular matrix components, including collagens, 
but does not serve a structural function in the ECM [6] . SPARC 
participates to not only regulate cell-cell and cell-matrix interac- 
tions, but also modify ECM deposition, influence angiogenesis, 
and alter the activity of a number of cytokines and growth factors 
[7] , such as stimulating the transforming growth factor beta (TGF- 
P) signaling system [8]. In addition, SPARC-null mice display a 
lesser amount of pulmonary fibrosis compared with wild type mice 
in animal models of bleomycin-induced pulmonary fibrosis [9]. 
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Table 1. Demographic and selected variables among the CWP cases and control subjects. 





Variables 


CWP(n = 697) 




Controlsfn 


= 694) 


P 




N 


% 


N 


% 




Age, years (mean±SD) 


68.0± 1 1 .1 




67.1 ±8.4 




0.103 


Exposure years (mean±SD) 


26.6±9.0 




27.3±7.8 




0.105 


Smoking status 


Never 


340 


48.8 


360 


51.9 


0.250 


Ever 


357 


51.2 


334 


48.1 




Former 


163 


23.4 


91 


13.1 




Current 


194 


27.8 


243 


35.0 




Pack-years smoked 










<0.001 


0 


340 


49.2 


360 


52.6 




0-20 


223 


32.0 


132 


19.0 




>20 


134 


19.2 


202 


29.1 




Occupation 










0.534 


Tunnel and coal mining 


663 


95.1 


652 


94.0 




Transport 


16 


2.3 


17 


2.5 




Others 


18 


2.6 


25 


3.6 




Stage 


1 


415 


59.5 








II 


219 


31.4 








III 


63 


9.0 











doi:10.1371/journal.pone.0105226.t001 



These points indicate that SPARC may play an important role in 
tissue fibrosis. 

Single nucleotide polymorphisms (SNPs) are the most frequent 
sequence variations in the human genome. The SPARC gene is 
located on human chromosome 5q3 1-32 [10], and there are at 
least 800 SNPs of SPARC reported in the SNP database (http:// 
www.ncbi.nlm.nih.gov/snp). Most of these are located in introns. 
SNPs located in exon regions may alter protein function, whereas 
SNPs in the gene promoter modify gene expression at transcrip- 
tion levels. SNPs in the 3 '-untranslated region (3'-UTR) may be 
regulated by microRNA at translational levels. 

There are a total of ten SNPs in the functional regions of the 
SPARC: one (rs2304052) in the exon, one (rs4958281) in 5'-UTR, 
and eight in 3'-URT with a minor allele frequency (MAF) greater 
than 0.05 in the Chinese population. Therefore, we selected the 
two SNPs located in the exon and 5'-UTR. Among the eight SNPs 



in 3'-UTR, three SNPs (rsl059279, rsl059829 and rsl053411) 
were selected based on a linkage coefficient of r">0.80. To date, 
there is no literature on genetic variants in SPARC and their roles 
in susceptibility to CWP. Therefore, the current study aimed to 
investigate the associations of five functional SNPs in SPARC and 
the susceptibility to CWP in the Chinese population, as well as 
their impact on the progression of the disease. 

Materials and Methods 

Study subjects 

From January 2006 to December 2010, 694 healthy males and 
697 males with definite CWP were recruited from the coal mines 
of Xuzhou Mining Business Group Co., Ltd., as described 
previously [1 1]. All subjects in our study were ethnic Han Chinese 
without a direct family/ genetic relationship. All studied individuals 



Table 2. Primary information of genotyped SNPs. 



Cluster ID Region dbSNP allele MAF HWE b 









dbSNP a 


Case 


Control 




rsl 059279 


3'UTR 


A>C 


0.33 


0.33 


0.30 


0.410 


rs4958281 


5'UTR 


OT 


0.08 


0.04 


0.05 


0.693 


rs2304052 


Exon 


T>C 


0.07 


0.07 


0.08 


0.297 


rsl 059829 


3'UTR 


OT 


0.48 


0.39 


0.35 


1 


rsl 053411 


3'UTR 


OG 


0.33 


0.34 


0.30 


0.929 



dbSNP 3 based on Asian in dbSNP; 

HWE b (Hardy-Weinberg equilibrium) P value of the control group. 
doi:1 0.1 371 /journal.pone.01 05226.t002 
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were selected from the same mines. Subjects were excluded if they 
had clinical evidence of autoimmune diseases, had received 
immunosuppressive or immunostimulatory therapy, or were 
subjected to radiotherapy. High kilovolt chest X-rays and physical 
examinations were performed to confirm diagnoses based on the 
China National Diagnostic Criteria for Pneumoconiosis (GBZ 70- 
2002). These criteria are identical to the 1980 International 
Labour Organization (ILO) in the judgment of opacity profusion 
[4,12]. Two independent physicians (Z. Song and X. Jia) assessed 
the chest X-rays. Pneumoconiosis cases were classified into stage I, 
stage II or stage III according to the size, profusion, and opacity 
distribution range. Controls were recruited from healthy subjects 
who were seeking outpatient physical examinations at the hospital 
department, and were frequency matched on age (±5 years), dust 
exposure period, and occupation. Using a double-blind investiga- 
tion method, live interviewers conducted participant question- 
naires. The epidemiological questionnaire focused on age, 
respiratory symptoms, occupational histories, and smoking habits. 
Five milliliter blood samples were obtained from all subjects and 
used for routine laboratory tests. A written informed consent was 
obtained from each subject before participating in the study. The 
protocol and consent form were approved by the Institutional 
Review Board of Nanjing Medical University (FWA00001501). 

SNPs selection 

To select the most likely functional SNPs influencing the 
SPARC gene, we selected the SNPs located in the exon, 3'-UTR, 
and 5'-UTR, as determined in the HapMap Genome Browser 
release (Phase 1 & 2 release - full dataset). We included the 
following criteria for SNPs: (i) the SNPs should be located in the 
exon, 3'UTR and 5'UTR, (ii) the minor allele frequency (MAF) 
should be >5% in the Chinese Han Beijing population (CHB), 
and (iii) in the case of multiple SNPs in the same haplotype block 
(linkage coefficient r~>0.8 in CHB of 1000 Genome database), 
only the most representative SNP was selected. Ultimately, three 
SNPs (rsl059279, rsl053411, and rsl059829) located in 3'UTR, 
one SNP rs2304052 in the exon, and one SNP rs958281 located in 
5'UTR were included in the study. The LD results of these five 
chosen SNPs demostrated that rs 1059829 had a medium linkage 
disequilibrium with rs 105341 1 (RSquared = 0.786), however, the 
rsl059279 was not included in 1000 Genome database. 

Genotyping 

Genomic DNA was extracted from leukocyte pellet by 
proteinase K digestion and followed by phenol-chloroform 
extraction and ethanol precipitation. Genotyping was performed 
using the TaqMan method with the ABI 7900 real-time PCR 
system (Applied Biosystems, Foster City, CA, USA). SDS allelic 
discrimination software (version 2.4, provided by ABI) was used 
for analysis of genotyping results. 

The sequences of the primers and probes for each SNP are 
available on request. Further detailed information about the 
sequence of each primer and probe is listed in Table SI. 
Amplification was performed in a total volume of 5 jo.1, containing 
50 ng of genomic DNA, 2.5 pi of Mix, 0.25 J_ll of each primer, 
0.125 pi of each probe and 1.25 |xl Nuclease-Free Water, under 
the following conditions: 50°C for 2 minutes and 95°C for 10 
minutes followed by 45 cycles of 95°C for 15 seconds and 60°C for 
1 minute. Negative controls were included in each plate to ensure 
accuracy of genotyping. For quality control, genotyping was done 
in a blinded fashion without knowledge of the workers' personal 
details or case/ control status of subjects. Furthermore, a random 
10% of cases and controls were genotyped twice by different 
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Table 5. Function prediction of SNPs. 



regolumeDB eQTL Dnase TFBS H3K27ac miRNA-LOSS 

rsl 059829 Id Y Y Y Y hsa-miR-541-5p 

rsl 059279 3a N Y Y Y 

rsl 053411 4 N N N Y hsa-miR-431 1 



doi:1 0.1 371 /journal.pone.01 05226.t005 

individuals, with a reproducibility of 100%. All relevant data are 
within Data SI. 

Statistical analyses 

Deviations of the characteristics for CWP patients and control 
subjects were examined by the Student-i test (for continuous 
variables) or the test (for categorical variables). Differences in 
allele and genotype frequencies of the two groups were assessed 
using the Pearson %2 test or Fisher's exact test. Unconditional 
multivariate logistic regression analyses adjusting for age, exposure 
years, pack-years smoked, and occupation, were used to estimate 
ORs and the 95% confidence interval (CI) for assessing the 
strength of association between the polymorphisms in SPARC and 
risk of coal workers' pneumoconiosis under various genetic 
models. These were defined as Aa versus AA and aa versus AA 
for co-dominant, aa+Aa versus AA for dominant, aa versus AA+ 
Aa for recessive, and a versus A for additive model (A: major allele, 
a: minor allele). Hardy-Weinberg equilibrium (HWE) was tested 
using a goodness-of-fit % -test. For the stratified analysis, the dust- 
exposure cut-off used was based on median dust-exposure years of 
the recruited patients and controls. Genotypes were coded as wild 
types (major-allele homozygote) and variants (minor-allele homo- 
zygote and heterozygote). P-values of less than 0.05 were 
considered statistically significant. All statistical tests were two- 
sided and were analyzed with R software (version 3.0). 

Results 

Demographic and clinical information is summarized in 
Table 1. No significant differences were observed for age 
(P = 0.103), exposure years (P = 0.105), or occupation (P = 0.534) 
between CWP patients and controls. The distribution of smoking 
status between cases and controls was parallel (P = 0.250). As 
expected, smoking amount (pack- years) in CWP cases was 
significandy more than that of controls (P<0.001). Pneumoconi- 
osis stages from I to III were identified as 59.5%, 31.4%, and 
9.0%, respectively. 

Details for the SNPs detected in our study are summarized in 
Table 2. All genotyped distributions of control subjects were 
consistent with those expected from the HWE. The MAFs of these 
five polymorphisms were consistent with those reported in the 
HapMap database. 

Furthermore, we performed a multivariate logistic regression 
analysis to assess the effect of each SNP on CWP risk (adjusting for 
age, exposure years, pack-years smoked, and occupation). Param- 
eters for the association of SNPs with CWP are shown in Table 3. 
There are three SNPs (rsl059279, rsl059829, rsl053411) 
significandy associated with the risk of CWP. Analysis under 
different genetic models revealed that the risk allele increased the 
susceptibility to CWP under co-dominant (OR =1.63, 
95%CI= 1.12-2.37, P- 0.011 for CC versus AA for rsl059279; 
OR= 1.56, 95%CI= 1.13-2.17, P = 0.007 for TT versus CC for 
rsl059829; OR = 1.49, 95%CI = 1.04-2.14, P = 0.029 for GG 



versus CC for rsl053411), recessive (OR= 1.53, 95%CI= 1.07- 
2.19, P = 0.021 for rsl059279; OR= 1.52, 95%CI = 1.12-2.06, 
P = 0.007 for rsl059829; OR= 1.43, 95%CI = 1.01-2.01, 
P = 0.041 for rsl053411), and additive (OR= 1.2, 
95%CI= 1.02-1.41, P = 0.027 for rsl059279; OR = 1.20, 
95%CI= 1.03-1.39, P = 0.021 for rsl059829; OR= 1.17, 
95%CI= 1.00-1.38, P = 0.044 for rsl053411) models. These 
associations remained significant after adjusting for age, exposure 
years, pack-years smoked, and occupation, except for the 
correlation between rsl 059829 and CWP risk under an additive 
model. As for rs2304052 and rs4958281, none of the genetic 
models showed any effect of the minor allele on the development 
of CWP. 

The associations between each of these three SNPs (rsl059279, 
rsl059829, rsl053411) and CWP risk were further stratified by 
exposure years and pack-years smoked. As shown in Table 4, the 
association between rsl 059279 and CWP risk remained significant 
among subjects who had greater than 27 years of exposure 
(OR= 1.27, 95%CI= 1.03-1.56, P = 0.023) and individuals with 
0-20 pack-years smoked (OR= 1.73, 95%CI = 1.21-2.45, 
P = 0.002) under an additive model. In addition, the variants 
rsl059829 and rsl053411 both also significandy increased CWP 
risk of individuals with 0-20 pack-years smoked (OR = 1 .48, 
95%CI= 1.07-2.05, P = 0.019 for rsl059829; OR = 1.58, 
95%CI= 1.13-2.22, P = 0.008 for rsl053411) under an additive 
model. The function prediction results of these three SNPs are 
listed in Table 5. 

To deeply evaluate potential interactions of SPARC gene 
polymorphisms on the risk of CWP, we combined five polymor- 
phisms based on the number of variant (risk) alleles (i.e.: 
rsl059279C, rs4958281T, rs2304052C, rsrsl059829T, 
rsrsl05341 1G). As shown in Table 6, individuals with multiple 
risk alleles did have a higher risk of CWP (Ptrend = 0.015). 

Discussion 

SPARC, a matricellular protein secreted from several different 
cell types into the ECM, may modulate interactions between cells 
and the surrounding ECM [6] . The elevated expression of SPARC 
has been reported in animal models of fibrotic disease and in 
human fibrotic tissues, including heart [13], lungs [14], kidneys 
[15] and others. However, in patients with pulmonary fibrosis, the 
expression of SPARC was found to be increased and was localized 
to the cytoplasm of pulmonary fibroblasts [16]. SPARC partici- 
pates in collagen deposition probably through three mechanisms 
in the pathogenesis of pneumoconiosis. First, SPARC might bind 
procollagen and prevent its interaction with cellular receptors, 
such as the discoidin domain receptor (DDR) 2 and integrin oc2 (3 1 
[6] . In the absence of SPARC, procollagen accumulates at the cell 
surface, resulting in less total collagen and fewer thick collagen 
fibers. Second, there is a reciprocal regulatory mechanism between 
SPARC and TGF-fi Blocking of TGF-R signaling by the ALK-5 
inhibitor SB-525334 significantly decreases SPARC expression as 
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well as the degree of fibrosis in vivo [17], whereas decreased 
SPARC expression leads to decreased TGF-P activity [18]. Last, 
SPARC may activate nuclear localization of (3-catenin and 
integrin-linked kinase (ILK) [19]. The activation of P-catenin in 
fibroblasts promotes stabilization of the myofibroblast phenotype 
and an anti-apoptotic phenotype, while the activation of ILK leads 
to ROS production, one of the causative factors of recurrent 
epithelial damage in fibrotic lungs [14,20]. 

Several mouse models have confirmed that SPARC is affiliated 
with pulmonary fibrosis. Savani et al. [21] used bleomycin sulfate 
infused intra-tracheally at 0.15 U/mouse to cause a fibrotic 
response in WT and SPARC-null mice. The outcome revealed 
that SPARC-null mice had increased tissue destruction and 
increased inflammatory cell recruitment, specifically neutrophils, 
in comparison to bleomycin-treated WT mice. These findings 
were consistent with Sangaletti's study [22]. The reasons behind 
the outcome are not readily apparent, but SPARC could be 
produced by both bone marrow-derived and lung fibroblasts, and 
different sources might play a different role [23]. Sangaletti used 
bone marrow chimeric mice and found that expression of SPARC 
in pulmonary fibroblasts promoted collagen deposition, while the 
expression of SPARC in bone marrow cells impeded inflammatory 
infiltrates. This elaborate study demonstrated the intricate 
association between fibrosis and inflammation [23]. 

It is well known that genetic and environmental factors are 
involved in the development of CWP. To our knowledge, this is 
the first evaluation of the association between functional SNPs in 
SPARC and pneumoconiosis susceptibility in a Chinese popula- 
tion. Statistical analyses identified three SNPs (rsl059279, 
rsl059829, rs 105341 1) that were significantly associated with 
pneumoconiosis. In addition, the rs 1059279 was not included in 
1 000 Genome database when we search for LD in SNP selection 
process. However, we found rsl059279 was in high linkage 
disequilibrium with rs 105341 1 (RSquared = 0.904) using our own 
genotyped data. Furthermore, stratification analyses were applied 
and hinted that each of these three SNPs (rsl059279, rsl059829, 
and rs 10534 11) significantly increased CWP risk of individuals 
with 0-20 pack-years smoking. 

Furthermore, we speculate on the function of three SNPs: 
rs 105341 1 might affect the miRNA-LOSS of hsa-miR-431 1, while 
rs 1059829 might affect the miRNA-LOSS of hsa-miR-541-5p, 
which could well be involved in the regulation of mRNA 
production and stability. Moreover, rs 1059829 was a locus of 
expression for Quantitative Trait Loci (eQTL) and Transcription 
Factor Binding Site (TFBS), thus could affect transcription activity 
and even consequendy predispose individuals to excessive 
fibrogenesis. The relevant TFBS include Nficbl, Interferon 
regulatory factor 4 (IRF4), B-cell CLL/lymphoma 3 (BCL3), 
and so on, all of which have innumerable links to the molecular 
mechanisms that result in the transcriptional activation of genes 
responsible for the fibrotic process. These findings set new insights 
into the role of SPARC in the pathogenesis of pneumoconiosis. 

Several limitations of this study should be addressed. First, the 
possibility of selection bias of subjects could not be ruled out in this 
population-based, case-control study. Second, our sample size was 
only moderate, further studies are required to replicate our results 
in larger and more diverse ethnic populations. Third, since five 
SNPs were tested, one might apply an appropriate multiple testing 
correction, such as the Bonferroni correction, otherwise the 
significant association between these three SNPs and CWP risk 
should be interpreted with caution. 

In conclusion, the present study indicates that three functional 
SPARC SNPs are associated with an increased risk of CWP in a 
Chinese population. Further functional research and validation 
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studies with diverse populations are warranted to confirm our 
findings. 

Supporting Information 

Table SI The sequences of the primers and probes for 
each SNP. 

(DOC) 
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